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Abstract 

A lower bound is presented which shows that a class of heap algorithms in the pointer model 
with only heap pointers must spend Q ( lo'g lo^g n ) amortized time on the Decrease-Key op- 
eration (given O(logn) amortized-time Extract-Min). Intuitively, this bound shows the key 
to having 0(l)-time Decrease-Key is the ability to sort O(logn) items in O(logn) time; Fi- 
bonacci heaps [M. .L. Fredman and R. E. Tarjan. J. ACM 34(3):596-615 (1987)] do this through 
the use of bucket sort. Our lower bound also holds no matter how much data is augmented; this is 
in contrast to the lower bound of Fredman [J. ACM 46(4):473-501 (1999)] who showed a tradeoff 
between the number of augmented bits and the amortized cost of Decrease-Key. A new heap 
data structure, the sort heap, is presented. This heap is a simplification of the heap of Elmasry 
[SODA 2009: 471-476] and shares with it a O(loglogn) amortized-time Decrease-Key, but 
with a straightforward implementation such that our lower bound holds. Thus a natural model 
is presented for a pointer-based heap such that the amortized runtime of a self-adjusting struc- 
ture and amortized lower asymptotic bounds for Decrease-Key differ by but a O(logloglogn) 
factor. 
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1 Introduction 



While Insert and Extract-Min are supported by all priority queues, there is one additional oper- 
ation which is of use in some algorithms: Decrease-Key. The fast execution of Decrease-Key is 
vital to the runtime of several algorithms, most notably Dijkstra's algorithm [Dij59j for single-source 
shortest paths in a graph. The constant-amortized-time implementation of the Decrease-Key 
operation is the defining feature Fibonacci heap [FT87| data structure. 

In |FSST86] . a new heap called the pairing heap was introduced. The pairing heap is a self- 
adjusting heap, whose design and basic analysis closely follows that of splay trees [ST85| . They are 
much simpler in design than Fibonacci heaps, and they perform well in practice. They were included 
in the (pre-STL) C++ data structures library. We have shown that they have a working-set like 
runtime bound [IacOOj . 

It was conjectured at the time of their original presentation that pairing heaps had the same 
O(logra) amortized time Extract-Min, and 0(1) amortized time Insert and Decrease-Key 
as Fibonacci heaps, however, at their inception only a O(logra) amortized bound was proven for 
all three operations. Stasko and Vitter [SV87] provided some simulation results which showed 
that O(l) Decrease-Key appeared likely. We have shown that Insert does in fact have 0(1) 
amortized time [Iaclll HacOOj . 

However in |Fre99b| . Fredman refuted the conjectured constant-amortized-time Decrease-Key 
in pairing heaps by proving that pairing heaps have a lower bound of r2(log log n) on the Decrease-Key 
operation. The result he proved was actually more general: he created a model of heaps which in- 
cludes both pairing heaps and Fibonacci heaps, and produced a tradeoff between the number of bits 
of data augmented and a lower bound on the runtime of Decrease-Key. Pairing heaps have no 
augmented bits, and were shown to have a f2(loglogn) amortized lower bound on Decrease-Key 
while he showed that in his model a 0(1) Decrease-Key requires f2(loglogn) bits of augmented 
information per node, which is the number of bits of augmented information used by Fibonacci 
heaps and variants (A number called rank, which is an integer with logarithmic range is stored in 
each node). More recently, Pettie has shown a upper bound of 2°( v/loglogn * > for Decrease-Key in 
pairing heaps |Pet05j . It remains open where in the range f2(loglogn) . . . 2°(v / i°g logn) true cost 
of Decrease-Key in a pairing heap lies. 

Elmasry has shown recently that a simple variant of pairing heaps has O(loglogn) amortized 
time Decrease-Key operation |ElmlOa[ IElm09j . However, for technical reasons described later, 
including a non-standard implementation of Decrease-Key, this variant is not in Fredman's model 
and thus the O(loglogn) lower bound does not apply. 

So, it would seem from the preceding exposition that the situation has been essentially resolved: 
Fibonacci heaps are complex but optimal, while the elegant pairing heaps (and Elmasry's variant) 
are as good as a self-adjusting structure can get (One defining feature of a self-adjusting structure 
is that they store no augmented data in every node). In the case of dictionaries, we have the hope 
of instance-based optimality, as evidenced by the dynamic optimality conjecture [ST85] . while in 
the case of heaps, self-adjusting structures can not even achieve optimal amortized asymptotic 
runtimes. We will now propose an alternate interpretation of the facts which leads to a much nicer 
conclusion. 

Fredman's model is nuanced, and has limitations which cause us to introduce here a new model, 
which we call the pure heap model. We first informally describe our model, and then describe how 
it differs from Fredman's. A pure heap is a pointer-based forest of rooted trees, each holding one 
key, that obeys the heap properly (the key of the source of every pointer is smaller than the key 
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Table 1: Comparison of various heaps, giving their amortized runtimes per round, where a round consists of one Insert, logn 
Decrease-Key operations, and one Extract-Min on a heap of size n. Tightness of analysis is the ratio of the best upper bound 
to the best lower bound. 



of the destination); the nodes may be augmented, and all operations must be valid in the pointer 
model; heap pointers are only removed if one of the nodes is removed or if a Decrease-Key is 
performed on the node that a heap pointer points to. This model is both simple and captures the 
spirit of many heaps like the pairing heap. However, Fibonacci heaps are not pure model heaps 
in the standard textbook presentation [CLRS09] for the following reason: each node is augmented 
with a loglogn-bit integer (in the range 1 . . .logn), and in the implementation of Extract-Min 
it is required to separate a number of nodes, call it k, into groups of nodes with like number. This 
is done classically using bucket sort in time 0(k + logn). However, bucket sort using indirect 
addressing to access each bucket is distinctly not allowed in the pointer model; realizing this, there 
is a note in the original paper [FT87] showing how by adding another pointer from every node to a 
node representing its bucket, bucket sort can be be simulated at no additional cost. However, these 
pointers are non-heap pointers to nodes which store no keys and thus violate th e tree requirement 
of the pure heap model. 

There have been several alternatives to Fibonacci heaps presented with the same amortized 
runtimes which are claimed to be simpler than the original. These include thin heaps [KT08] 
violation heaps [Elm 10b] . and rank-pairing heaps [HSTllj . All of these heaps also use indirect 
addressing or non-heap pointers. The rank-pairing heap has the cleanest implementation of all of 
them, and implements Decrease-Key by simply disconnecting the node from it parents and not 
employing anything more complicated like the cascading cuts of Fibonacci heaps. 

We can easily modify Fibonacci heaps and the aforementioned alternatives to only use heap 
pointers by simply using O(loglogn) time to determine which of the O(logn) buckets each of the 
k keys are in. We call such a variant of rank pairing heaps a pointer Rank-Pairing heap, and list 
in in Table [TJ However, this alteration has the effect of increasing the time of Decrease-Key in 
a Fibonacci heaps and their variants to O (log logn). 

Fredman's model differs from ours in several regards. First, Fredman's model requires that 
comparisons can not be performed unless the nodes being compared must be linked by a heap 
pointer after the comparison. Second, in the course of an Extract-Min, any two children of the 
former root can be randomly accessed and compared at unit cost. Third, the number of augmented 
bits per node is a parameter of the model. The first restriction, while it admits pairing heaps and 
Fibonacci heaps, excludes Elmasry's variant. This is because Elmasry's variant sorts the keys in 
nodes in order to determine how to link them; such sorting is directly against what is allowed in 
Fredman's model. Our sort heaps presented in section [6] are not in Fredman's model for the same 
reason. Also, subjectively, we find the first restriction a bit odd, but it appears to be vital to the 
result. The second difference means that a fundamental cost in the pointer model is not counted: 
moving pointers to reach the desired nodes to compare or otherwise manipulate. So, compared to 
our model, Fredman's model is more restrictive because of the first condition, and more permissive 
with the second condition. We feel our model is more natural. 

Thus, we conclude that the reason that Fibonacci heaps have fast Decrease-Key is not (only) 
because of the augmented bits as Fredman's result suggests, but rather because they depart from 
the pure heap model. We prove that any pure-heap-model heap has an £1 (j^^^^j amortized 
lower bound on decrease key, no matter how many bits of data each node is augmented with. Such 
a bound is more than a mere technicality; it would elevate the status of a pairing heap variant with 
0(log log n) amortized time Decrease-Key to something conjectured to be amortized optimal and 
proved to be within an 0(log log logn) factor of being an amortized asymptotical optimal structure. 
In our view, Fibonacci heaps are a typical RAM-model structure that squeezed out a log log n factor 
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over the best structure in a natural pointer-based model by beating the sorting bound using bit 
tricks (of which bucket sort is a very primitive example) . 

Given our lower bound, we still have the issue that it does not apply to any known self-adjusting 
heaps with fast Decrease-Key times. We rectify this in by introducing the sort heap. This 
heap is simply Elmasry's heap with the non-standard Decrease-Key replaced with the stan- 
dard one, or to put in another way, it is identical to the pairing heap except for the choice 
of pairings used to implement the Extract-Min operation. Our sort heap has an O(loglogn) 
amortized Decrease-Key operation, and features an analysis that differers markedly from other 
self-adjusting heaps with fast Decrease-Key. 

The paper is structured as follows: we begin by reviewing the priority queue ADT in and 
then formally present our pure heap model ($3]) followed by a Fredman's generalized pairing heap 
model for comparison (§H). The main result, our lower bound, is presented in £}5j and this is followed 
by sort heaps which are presented in Finally, we close in £}7]with some thoughts and directions 
for further work. 

2 The priority queue abstract data type 

A priority queue supports the following operations to maintain a totally ordered set S: 

• p =Insert(x): Inserts the key x into S and returns a pointer p used to perform future 
Decrease-Key operations on this key. 

• x =Extract-Min(): Removes the minimum item in S from S and returns it. 

• Decrease-Key(j>, Ax): Reduces the key value pointed to by p by some non-negative amount 
Ax. 

The key values can be in any form so long as they are totally ordered and an 0(l)-time com- 
parison function is provided. (This is more permissive than saying they they are comparison-based. 
We do not restrict algorithms from doing things like making decisions based on individual bits of a 
key). Some priority queues, including Fibonacci heaps and our sort heaps, also efficiently support 
the Meld operation where two priority queues are combined into one. 

3 The pure heap model 

Here we define the pure heap model, and how priority queue operations on data structures in this 
model are executed in it. 

3.1 Structural invariants and terminology 

The pure heap model requires that at the end of every operation, the data structure is an ordered 
forest of general heaps. (Equivalently, the data structure is a half-ordered tree.) Each node is 
associated with a key x £ S. We will use x both to refer to a key and the node in the heap 
containing the key. Inside each node is stored the key value, pointers to the parent, leftmost child 
and right sibling of the node, along with other possible augmented information. 

The structure of the heap is the shape of the forest, without regard to the contents of the nodes. 
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The location of a node is its position in the forest of heaps relative to the right (i.e. a node could 
be described as being the fifth child from the right of a node which is the third child from the right 
of the fourth root from the right. The idea is that position is invariant under adding new siblings 
to the left). 

An algorithm in the pure heap model implements the priority queue operations as follows: 

• Insert operations are executed by adding the new item as a new leftmost heap. The cost is 
defined to be 1. 

• Decrease-Key is executed by disconnecting from its parent the node containing the key to 
be decreased (if it is not a root), decreasing the key, and then placing it as the leftmost root 
in the forest of heaps. The cost is defined to be 1. 

• A Extract-Min operation is performed by first executing a sequence of pointer-based subop- 
erations which are fully described below in H3.21 After executing the suboperations, the forest 
is required to be monoarboral (i.e. have only one heap). Thus, the root of this single tree has 
as its key the minimum key in S. This node is then removed, the key value is returned, and 
its children become the new roots of the forest. The cost is the number of sub-operations 
performed. 

Note that some data structures are not presented exactly in the framework as described above 
but can be easily put into this mode by being lazy. For example, in a pairing heap, the normal 
presentations of Insert and Decrease-Key cause an immediate pairing with the single existing 
root. However, such pairings can easily be deferred until the next Extract-Min, thus putting the 
resulting structure in our pure-heap framework. For a more elaborate example of transforming a 
heap into one based on pairings, see |Fre99a] . 

3.2 Executing a Extract-Min; suboperations 

Obviously, in order to execute an Extract-Min, the minimum must be determined. While at 
the beginning of the Extract-Min operation, the structure of the heap may consist of a forest 
of many heap-ordered trees, the pure-heap model requires that these trees be combined into one 
tree through a process called pairing. The pairing operation takes two roots and attaches the root 
with larger key value as the leftmost child of the root with smaller key value. (Note that while the 
pairing operation brings to mind pairing heaps, it is the fundamental building block of many heaps. 
Even the skew heap |ST86| . which seems at first glance to not use anything that looks like the 
pairing operation, can be shown in all instances to be able to be transformed into a pairing-based 
structure at no decrease in cost |Fre99aj ) . We require this process to happen in the pointer model, 
where there is some constant number of pointers that start at the leftmost root and move around 
and perform pairings. For the purposes of the analysis, it is needed to have a very fine view of what 
constitutes a constant-time suboperation, but, what is presented below is constant-time-equivalent 
to other natural ways of having a pointer model view with the basic primitive being pairing of the 
roots. 

In the execution of the extract-min operation, the use of some constant number p of pointers 
Pi,P2, ■ ■ -Pp is allowed. They are all initially set to the leftmost root. The constant p is a parameter 
of the model. These are the suboperations that are allowed to execute the Extract-Min operation: 

1. HasParent(z): Return if the node pointed to by pi has a parent. 
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2. HAsLEFTSiBLiNG(i): Return if the node pointed to by pi has a left sibling. 

3. HasRightSibling(z): Return if the node pointed to by pi has a right sibling. 

4. HASCHlLDREN(i): Return if the node pointed to by pi has any children. 

5. Compare^, j): Return if the key value in the node pointed to by pi is less than or equal to 
the key value in the node pointed to by pj . 

6. Pair(z, j): Perform a pairing on two pointers pi and pj where the tree that pj points to is 
attached to pi as its leftmost subtree. It is a required precondition of this sub-operation that 
both pi and pj point to roots, and that this was verified by the HasParent suboperation, 
and that the key value in the node pointed to to pi is smaller than the key value in the node 
pointed to by pj, and that is was verified by the Compare suboperation. 

7. Set(«, j): Set pointer pi to point to the same node as pj. 

8. MoveToParent(z): Move a pointer pi to the parent of the node currently pointed to. It is 
a precondition of this operation that the node that pi points to has a parent, and that this 
was verified by the HasParent operation. 

9. MoveToLeftmostChild (i) : Move the pointer pi to the leftmost child. It is a precondition 
of this operation that the node that pi points to has children, and that this was verified by 
the HasChildren operation. 

10. MovEToRiGHTSiBLiNG(i): Move the pointer pi to the sibling to the right. It is a precondi- 
tion of this operation that the node that pi points to has a right sibling, and that this was 
verified by the HasRightSibling operation. 

11. MoveToLeftSibling(z): Move a pointer pi to the sibling to the left It is a required precon- 
dition of this operation that the node that pi points to has a left sibling, and that this was 
verified by the HasLeftSibling operation. 

12. End(): Marks the end of the suboperation sequence for a particular Extract-Min. It is 
a required precondition of this operation that the forest of heaps contains only one heap, 
and that this was verified through the use of the HasParent(z), HAsLEFTSlBLlNG(i), and 
HasRightSibling (i) on a pointer pi that points to the unique root. 

Operations Q3H] return a boolean; the remainder have no return value. 

The total number of suboperations is defined to be rj. Observe that 7] = 0(p 2 ), which is 0(1) 
since p is a constant. 

A sequence of suboperations is a valid implementation of the Extract-Min operation if all 
the preconditions of each suboperation are met and the last suboperation is an end(). 

In the pure heap model, the only thing that differentiates between different algorithms is in 
the choice of the suboperations to execute Extract-Min operations. In these operations it is 
the role of the particular heap algorithm to specify which suboperations should be performed for 
each Extract-Min. We place no restrictions other than determinism as to how an algorithm 
determines the sub-operation sequence for each Extract-Min: the sub-operation sequence must 
be valid. 
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This definition of an algorithm encompasses and is more permissive than allowing the algorithm 
to make decisions to be made based on some data augmented at any node. This is because aug- 
mented information is just one type of function of the previous operations. We also note that while 
the definition of the pairing operation enforces the heap structure, where every parent is smaller 
than its children, the algorithm is not restricted from looking at, for example, the individual bits 
of a key and deciding which pairings to perform based on this. 

To summarize, our notion of an algorithm allows the algorithm, at every step of determining 
which sub-operation to execute next, to make decisions in any manner — even randomization is 
allowed. These computations to determine which sub-operation to execute next have no cost; only 
the execution of the sub-operations themselves incurs a cost. 

4 Fredman's model 

Fredman's model, which he calls generalized pairing heaps differ from our pure heap model in a 
number of significant aspects: 

• Generalized pairing heaps are parametrized by the number of bits of augmented data allowed 
at each node. Pure heaps allow an algorithm to branch as any function of the past; this is 
equivalent to allowing unlimited augmented data. 

• Generalized pairing heaps limit how performing a comparison can be done by the algorithm. 
In the process of executing an Extract- MlN operation, suppose a comparison is performed 
between two nodes, neither of which will be removed by the operation. This limits the 
number of comparisons to be performed in a ExTRACT-MlN to be linear in the number of 
pairings performed. In a generalized pairing heap, the result of this comparison can not be 
used to determine the future actions of the algorithm. In pure heaps, the result of such a 
comparison can be used. It is this crucial difference that places Elmasry's variant of the 
pairing heap and our sort heap in the pure heap model, but not in the generalized pairing 
heap model. These algorithms perform Decrease-Key operations on selected roots after 
performing comparisons to sort them; sorting roots is out-of-model in the generalized pairing 
heap model; this is easy to see because sorting requires doing a super-linenar number of 
comparisons. 

• Generalized pairing heaps do not take into account the time needed to access the items that 
are to be paired; arbitrary pairing of roots is allowed at unit cost. In the pure heap model, 
one needs to move pointers to the nodes to be paired using pointer-model-operations on the 
heap, and these operations must be paid for. 

5 Lower Bound 

5.1 Statement of result 

Theorem 1. In the pure heap model with a constant number p pointers, if Extract-Min and 
Insert have an amortized cost o/O (log n), then Decrease-Key has an amortized cost offl( r-j# 



The proof will follow by contradiction and will consume the rest of this section. Assume that 
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there is a pure heap model algorithm A where Extract-Min and Insert have an amortized cost 
of at most clogn, for some constant c, and Decrease-Key has an amortized cost of at most 
dc{n), for some dc{n) — o ( lo^f^^g n J j since this is a lower bound we can also safely assume that 
dc(n) = oo(l). The existence of the algorithm A, the constant c and the function dc(n) will be 
assumed in the definitions and lemmas that follow. A sufficiently large n is also assumed, as there 
are several places noted in what follows, such as using the results of asymptotic expressions, where 
this is required. 

5.2 Overview of proof 

The proof is at its core an adversary argument. Such arguments look at what the algorithm has 
done and then decide what operations to do next in order to guarantee a high runtime. But, our 
argument is not straightforward as it works on sets of sequences of operations rather than a single 
operation sequence. There is a hierarchy of things we manipulate in our argument: 

Suboperaton. The suboperations of N3.2I are the very basic unit-cost primitives that can be used 
to implement Extract-Min, the only operation that does not have constant actual cost. It 
is at this level that definitions have been made to enforce pointer model limitations. 

Operation. We use operation to refer to a priority queue operation. In this proof, the adversary 
will only use Insert, Decrease-Key, and Extract-Min. 

Sequence. Operations are combined to form sequences of operations. 

Set of operation sequences. Our adversary does not just work with a single operation sequence 
but rather with sets of operation sequences. These sets are defined to have certain invariants 
that bound the size of the sets of operation sequences under consideration. 

Evolutions. We use the word evolution to refer to a function the adversary uses to take a set of 
operation sequences, and modify it. The modifications performed are to add operations to 
sequences, remove sequences, and to create more sequences by taking a single sequence and 
adding different operations to the end. 

Rounds. Our evolutions are structured into rounds. 

The proof will start with a set containing a single operation sequence, and then perform rounds 
of evolutions on this set; the exact choice of evolutions to perform will depend on how the algorithm 
executes the sequences of operations in the set. The evolutions in a round are structured in such 
a way that most rounds increase the size of the set of operations. After sufficiently many rounds, 
the invariant on the maximum size of the set of operation sequences will be violated, thus giving a 
contradiction. 

Our presentation is structured as follows: In §5.31 we define a rank function. In E j5.4M5.5l we 
give some invariants and facts about the sequences of operations we will be considering. In §5.6.11 
we introduce the idea of a set of sequences and explain the invariants of the sets that will be 
maintained. We introduce the idea of an evolution in §5.6.21 and then describe several types of 
evolutions in §5.6.3115.6.81 These evolutions are structured into rounds in §5.7[ technical lemmas 
about rounds appear in §5.81 and the final work to obtain the contradiction is in §5.91 
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5.3 Ranks: definitions and useful facts 



5.3.1 Motivation 

As in many previous works on heaps and trees, the notion of the rank of a node in the heap is 
vital. The rank of a node is meant to be a rough proxy for the logarithm of the size of the subtree 
of the node. While the basic analysis of pairing heaps and splay trees [FSST861 IST85] use exactly 
this as the rank, the definition of rank here is more delicate and is an extension of that used in 
[Fre99bj . As in |Fre99bj . rank here is always a nonnegative integer. In previous definitions of rank, 
the value typically depended only on the current structure of the heap (One exception to this has 
been in order to get better bounds on Insert, nodes are treated differently for potential purposes 
depending on whether or not they will ever be deleted. See [SV871 HaclH HacOO] ). Here, however, 
the definition is more nuanced in that for the purposes of the analysis only, nodes are classified 
into marked and unmarked categories based on the history of the structure, and these marks, along 
with the current structure of the heap, are used to compute the rank of each node. 

5.3.2 Definition 

For ease of presentation, the rank of a node is defined in terms of the function e(n) = 2dc(n) + 1. 

The general idea is to have the rank of a node be maintained so it is the negation of the key value 
stored in the node. (Ranks will be non-negative, and we will only give nodes non-positive integer 
key values). The rank of a node can increase as the result of a pairing, and the value of a node can 
decrease as the result of a Decrease-Key. It is thus our goal to perform a Decrease-Key on 
a node which has had its rank increase to restore it to the negation of its rank. During the time 
between when a rank increase occurs in a node and the time the Decrease-Key is performed, we 
refer to the node as marked. 

Call the unmarked subtree of a node to be the subtree of a node if all marked nodes were 
detached from their parents; the unmarked structure of the heap is the structure of the unmarked 
subtrees of the roots. The rank of a node at a given time will be defined to be a function of the 
structure of its unmarked subtree. We emphasize that the notion of marking a node is for the 
purposes of the analysis only, such marks need not be stored. 

The following assumes a particular heap structure and marking, as the rank of a node is always 
defined with respect to the structure of the heap after executing a sequence of heap operations. 

Let x be the node we wish to compute the rank of. Let k denote the number of unmarked 
children of x, and let yi,U2, ■ ■ ■ Vk denote these children numbered right-to-left (i.e., in the order 
which they became children of x). 

Let Ti(x) be a subtree of x consisting of x connected to only the subtrees induced by yi, y2, ■ ■ ■ y%. 
We will define the function Viix) as a function of Tj(x). The rank of a node, r(x) is r^ix). 

Each node yi may be labeled as efficiently linked to its parent. If rj_i(x)— e(n) < r{yi) < rj_i(x), 
then the yi is said to be efficiently linked to x. The case of r(yi) > rj_i(x) will never occur, as 
pairings will only happen among unmarked nodes, where the rank perfectly matches the negation 
of the key value. 

We will have the property that if ri(x) is either rj_i(x) or n-i(x) + 1; In the latter case, j/, is 
called incremental. 

Given a node yi, let j be defined to be the index of the first incremental node in the sequence 
(yi—l, y%—2% ■ - •}; j is defined to be if there is no such nonincremental node. The set N{yi) is 
defined to be {yt\j < k < i}; that is, yi and the maximal set of its non-incremental siblings to the 
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right. 

Given these preliminaries, we can now give the full definition of the rank of a node that has 
defined rank: 



f if i = 



ri-i(x) + 1 

— or 



(Efficient case) t)i is efficiently linked and is the e(n)th efficient 
element of N(yi) 



(Default case) \N( yi ) \ = 2 e ^ 
ri^i(x) otherwise 

While the rank and mark are interrelated, there is no circularity in their definitions — whether 
a node is marked depends on its rank and key value and the rank of a node is a function of the 
ranks and marks of its children. 

Lemma 2 (Structural property of rank). Given two nodes x and y with different ranks, the un- 
marked structure of their induced subtrees must be different. 

Proof. Follows directly from the fact that the rank of a node is a function of its induced unmarked 
subtree. □ 

Lemma 3. The size of a unmarked subtree induced by a node of rank k is at most e(n) k 

Proof. Let Sk be the size of the maximum unmarked heap of rank k. Such a heap can be created 
from a maximal unmarked heap of rank k — 1, which has been paired to e(n) — 1 maximal unmarked 
heaps of rank k — 1 and 2 e ( n ) — (e(n) — 1) maximal unmarked heaps of rank k — e(n) — 1. Thus: 

Sk < (e(n) - l)s fe _i + 2 e(n) s fe _ e(n )_i. 

By induction, 

s k < (e(n) - l)e(n) k - 1 + 2 e ^e(n) k ~ e ^- 1 = e(n) k + e(n) k -\2 e ^e(n)- e ^ - 1). 

For sufficiently large < 1 (recall that e(n) = 2dc(n) + 1 and dc(n) = w(l)), 

thus 

e{n) k + e{n) k - 1 {2 e{n ^e(n)- e ^ - 1) < e(n) fc 
which completes the lemma. 

□ 

Corollary 4. // there are m nodes in a node x 's unmarked induced subtree, the rank of x is at 
least log e ( n ) m. 

Lemma 5 (Number of efficiently linked children). Suppose node v has rank > k and at most |2 e ( n ) 
unmarked children. Then, v has at least k/2 efficiently linked unmarked children, each having rank 
< k. 
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Proof. If there were less than k/2 efficiently linked children, than at least k/2 rank increments 
would be caused by the default case of the definition of rank and thus there would be at least ^2 e ^ 
unmarked children, a contradiction. □ 

Corollary 6. Consider a root with umarked subtree size m. The root has rank > log e ( n \ m by 

Corollary^ Suppose it has < loge W m 2 e ( n ) unmarked children. Then it has > | log e( - n ) m efficiently 
linked unmarked children. 

Observe that: 



L2_ 2 e W = . l ogm 

2 loge(n) 



and 



1 

2~ &e W"'- 21oge(n) 



- log e(n) m = 77T7 - — • log m 



2 e ^- 1 „„j „/^\ _ i 



We now use these observations to restate Corollary [6l Set f(n) = log e ^ and g{n) — 2 \ ge(n) • 
Then: 

Corollary 7. Suppose a root with unmarked subtree of size m has < /(n)logm children. Then it 
has > g{n)\ogm efficiently linked children. 

We note that f(n) = = log(2dc(n)+1) = o(logn) since dc(n) = o(loglogn). 

5.4 Monotonic operation sequences 

Call the designated minimum root the next node to be removed in an Extract- Min. 

Definition 8 (Monotonic operation sequence). In a monotonic operation sequence, Decrease-Key 
operations are only performed on roots, children of the designated minimum root, or marked nodes. 

Observation 9 (Monotonic sequences and structure). In a monotonic operation sequence, for 
any node x with descendent y where all nodes on the path from x 's child up to and including y 
are unmarked, y will remain in the same location in x 's subtree until x becomes the designated 
minimum root. 

Observation 10 (Monotonic sequences and rank). In a monotonic operation sequence, the rank 
of a node never decreases, from the time it is inserted until the time it becomes the designated 
minimum root. 

All of the sequences of operations we define will be monotonic. 
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5.5 Augmented suboperation 

We augment suboperation [6l the Pair(-) operation, to return whether or not the rank was incre- 
mented as a result of the pairing. This augmentation does not give any more power to the pure 
heap model, since the exact ranks of all nodes is a function of the suboperation sequence and is 
thus known to the algorithm already. We use this augmentation to create a finer notion of what 
constitutes a distinct sequence of suboperations. In particular, we will use the following fact: 

Lemma 11. Suppose Sj and Sj are two structurally distinct states of the data structure. Suppose a 
single valid sequence of sub-operations implementing an Extract-Min is performed on both, and 
the outcomes of all augmented sub- operations that have return values are identical in both structures. 
Then, the position of all nodes who have had their ranks changed is identical in both. 

Proof. The only time a node can have it's rank change is when something is paired to it, and this 
is now explicitly part of the return value of operation [6J □ 

5.6 Evolutions of indistinguishable sequences 
5.6.1 Definitions 

Let B = 62, . . .) be a sequence of priority queue operations. 

Let A(bi) = (a\,a l 2, ■ ■ ■) be the sequence of augmented suboperations and their return values 
used by algorithm A to execute operation bi if bi is an Extract-Min; if it is not A{bi) is defined 
to be the empty sequence. A(B) is the concatenation of A(bi), Afo), • • •• 

We call two sequences of priority queue operations B and B' algorithmically indistinguishable 
if A(B) = A(B'), else they are algorithmically distinct. 

Let SB(i) be the structure of the heap after running sequence (61 . . . bi); the terminal structure 
of B is sb(\B\) which we denote as sb- Recall that by structure, we mean the raw shape of the heap 
without regard to the data in each node, but including which nodes are marked. Two sequences 
B and B' are terminal- structure indistinguishable if sb = sb', else they are terminal- structure 
distinct. 

Note that having two sequences which are algorithmically indistinguishable does not imply 
anything about them being terminal-structure indistinguishable. For example, it may be possible 
to add a Decrease-Key to a sequence, changing the terminal structure, while the sequence of 
sub-operations performed to execute the sequence remains unchanged. (Recall that sub-operations 
only occur during Extract-Min operations). 

Given a set of mutually algorithmically indistinguishable and terminal-structurally distinct (AI- 
TSD) sequences of heap operations 3, the distinctness of the set, £(3) is defined to be log |3|. 

Lemma 12. The maximum distinctness of any set 3 of terminal-structurally distinct sequences, 
all of which have terminal structures of size n, is £(3) = 0{n). 

Proof. The number of different shapes of a rooted ordered forest with n nodes is C n , the n th Catalan 
number. The number of different ways to mark some nodes in a forest with n nodes is 2™. Since 
Cn < 4 n , the maximum number of distinct structures is at most log(2 n 4 n ) = 0{n). □ 
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5.6.2 Evolving 

We will now describe several functions on AI-TSD sets of heap operations; we call such functions 
evolutions. The general idea is to append individual heap operations or small sequences of heap 
operations to all sequences in the input set H and remove some of the resulting sequences so as to 
maintain the property that the sequences in the resultant set of sequences E' are algorithmically 
indistinguishable yet terminal-structure distinct. The evolutions will also have the property that if 
the time to execute all sequences in E is identical, then the runtime to execute all sequences in E' 
will also be identical. The difference in the runtime to execute sequences in E' versus those in S 
will be called the runtime of a evolution. 

5.6.3 Insert evolution 

The insert evolution has the following form: S' = Evolve-Insert(E). 

In an insert evolution, a single Insert operation of a key with value is appended to the end 
of all E to obtain E'. Given E is AI-TDS, the set E' is AI-TDS and trivially £(S) = f(H'). The 
runtime of the evolution is 1 since the added Insert has runtime 1. The rank of the newly inserted 
node is 0, and is thus unmarked. 

5.6.4 Decrease-key evolution 

The decrease-key evolution has the following form: E' = Evolve-Decrease-Key(S,p), where p 
is a location which is either a root or a marked node in all terminal structures of sequences in E. 

In a Decrease-Key evolution, a Decrease-Key(p, Ax) operation is appended to the end 
of all sequences in E to obtain E'. The value of Ax is chosen such that the new key value of 
what p points to is set to is the negation of its current rank; this means Ax is always nonnegative 
because of the monotone property of ranks noted in Observation [TUJ Observe that if p points to a 
marked node, then it is unmarked after performing a Evolve-Decrease-Key. This requirement 
ensures that since all structures that are distinct before this operation will remain distinct after 
the operation. Thus, the set E' is AI-TDS and trivially £(E) = £(E'). The runtime of the evolution 
is 1 since the added Decrease-Key has runtime 1. 

5.6.5 Designated minimum root evolution 

The designated minimum root evolution has the form E' = Evolve-Designated-Minimum-Root(S, r), 
where r is the position of one root which exists in all terminal structures of E. 

In a designated minimum root evolution, a Decrease-Key operation on r to a value of negative 
infinity is appended to all sequences in E to give E'. It will always be the case that the (next) 
evolution performed on E' will be an Evolve-Extract-Min evolution; the root r, which is known 
as the designated minimum root, will be removed from all terminal structures of E' in this subsequent 
Evolve- Extract-Min. There is no change in distinctness caused by this operation: £(E) = £(S'). 
The runtime of the evolution is 1 since the added Decrease-Key has runtime 1. 

5.6.6 Extract-min evolution 

The extract-min evolution has the form (E',V,em) = Evolve-Extract-Min(E). 



14 



The Evolve-Extract-Min evolution is more complex than those evolutions previously de- 
scribed, and the derivation of 3' from 3 is done in several steps. First, a Extract-Min operation 
is appended to the end of all sequences in 3 to obtain an intermediate set of sequences which we 
call S. Recall that the Extract-Min operation is implemented by a number of sub-operations. 
There is no reason to assume that the sub-operations executed by the algorithm in response to 
the Extract-Min in each of 3 are the same; thus the set 3 may no longer be algorithmically 
indistinguishable. We fix this by removing selected sequences from the set 3 so that the only ones 
that remain execute the appended Extract-Min by using identical sequences of supoperations. 
This is done by looking at the first suboperation executed in implementation of Extract-Min 
in each element of 3, seeing which sub-operation is the most common, and removing all those 
sequences 3 that do not use the most common first sub-operation. If the sub-operation is one 
which has a return value, the return value which is most common is selected and the remaining 
sequences are removed. This process is repeated for the second sub-operation, etc., until the most 
common operation is End() and thus the end of all remaining sub-operation sequences has been 
simultaneously reached. Since there are only a constant r/ number of sub-operations, and return 
values, if present, are boolean, at most a constant fraction (specifically ^) of 3 is removed while 
pruning each sub-operation. At the end of processing each sub-operation by pruning the number 
of sequences, new set is returned as 3'. The set 3' can be seen to be terminal-structure distinct, 
since pairing identically located roots in structurally different heaps, and having the same nodes 
win the pairings, can not make different structures the same. 

Observe that the nodes wining pairings in the execution of the Extract-Min might have their 
ranks increase, and thus become marked. Moreover, due to Lemma [TT1 the position of all such 
nodes will be identical in all terminal structures of H'. The set of the locations of these newly 
marked nodes is returned as V, the violation set. 

Now that it has been ensured that all of the sets of operations execute the appended Extract-Min 
using the same sub-operations, we define em to be this common number of sub-operations used 
to implement the Extract-Min; this value is returned by the evolution. As each sub-operation 
reduces the distinctness by at most a constant, £(H') > f(3) — emlog(2rj) = £(S) — O(em). The 
runtime of the evolution is em since that is the cost of the added Extract-Min. 



5.6.7 Big/small evolution 

The big/small evolution has the form (3', (p,bigsmall)) = Evolve-Big-Small(3). 

In a big/small evolution, each terminal structure of each of the operation sequences of 3 is 
classified according the following, according to the previously-defined function /(re): 

• The exact number of roots if less than /(re) log re or the fact the the number of roots is greater 
than /(re) log n (we call this case many-roots). 

• If the exact number of roots is less than /(n) log re: 

— The position of the root with the largest subtree (the leftmost such root if there is a tie). 
Call it p. Observe that the size of p's subtree is at least j( n " logra - 

— The exact number of children of p if less than f(n) log j( n )" ogn (we call this case small) 
or the fact that the number of roots is greater than f(n) log f( n ™\ og n ( we can this case 
root-with-many- children) . 



15 



There are at most [/(n) log re] • \f(n) log ^ n " logr J possible classifications. We create set H' by 
removing from S sequences with all but the most common classification of their terminal structures. 
The return value is based on the resultant classification: 

Many-roots: Return (p,bigsmall) where p = NULL and bigsmall = Big. 

Root-with-many-children: Return (p, bigsmall) where p is the location of the root with the 
largest subtree and bigsmall = Big. 

Small: Return (p, bigsmall) where p is the location of the root with the largest subtree and and 
bigsmall = Small. 

We bound the loss of distinctness, which is the logarithm of the number of classifications. 
Since f(n) = o(logn), then log (\f(n) log re] • \f(n) log /(n) " log j ) = O (log log n), and thus £(E') = 
£(S) — 0(log log n). The runtime of the evolution is since no operations are added to any sequence. 

5.6.8 Permutation evolution 

The permutation evolution has the form S' = Evolve-Permute(H), where the leftmost root r 
has in all terminal structures of the sequences of H subtree size at least j( ra )" ogn and has a number 
of children which is at most /(re) log y( n )" ogra ? this w iU be achieved by being in the small case of 
the big/small evolution and performing a decrease-key evolution on the relevant node. It is also 
required that all terminal structures of sequences in 3 are entirely unmarked. 

In a permutation evolution, the goal is to increase the distinctness, and is the only evolution to 
increase the number of sequences in the process of converting E to H'. 

Combining the preconditions of the permutation evolution with corollary yields the fact that 
all nodes in the terminal structures of S at location r have at least g(n) log j( n )" ogra efficiently linked 
children; since there are at most e(n) efficiently linked children of each rank, that means there are 

g(n) log ,, 

at least e(n) ° B " efficiently linked children of different ranks, in each terminal structure find 

such a set and call it the permutable set (chose one arbitrarily if there is more than one possibility). 
Look at the position of these permutable sets in all terminal structures of the sequences of H, 
and pick the position of the permutable set that is most common. Form the intermediate set of 
sequences H by removing from S all sequences that do not have this commonly located permutable 
set. An easy upper bound on the number of different locations permutable sets could be in 

Size of permutable set 

A Number of children of r 



f(n) log 



e(n) /(n)logn 

Thus this step decreases the distinctness of the set by at most the logarithm of the number of 
commonly located permutable sets: 



e(n) /(re) log n 

As f(n) = o(logre), g{n) = o(l), e(n) = w(l), one can simplify this to 
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£(H)-£(H) = -0(loglogn). 
The permutable set is of size ^— — ^^ n) log " . Using the definitions of f(n) and cfc(n), this is 

6 ^ d c ( n ) 1 iog"dc(ra) ) ■ Let 771 De a constant such that the permutable set is of size at least dc(n)logdc(n) 
for sufficiently large n. Recall that in all the terminal structures of the sequences of E, the ranks 
of the items in the permutable sets are different, and in fact are strictly increasing, when viewed 
right-to- left as children of r. 

We then create H' by replacing each sequence in 3 with ( dc(n)\ogdc(n) ) • new sec i uen ces cre- 
ated by appending onto the end of each existing sequence a sequence of all possible permutations 
of Decrease-Key operations on all elements of an arbitrary subset of size mlogn Q £ ^ 

L J dc(n) log dc(n) 

permutable set. 

The fact that all of sequences in 3 have the same permutable sets ensures that all terminal 
structures in 3' are terminal-structure distinct. (Recall that Lemma [2] says that different ranks 
imply different structures of induced subtrees). Thus, in this step distinctness increases by £(S') — 

c(i:\ — l na . ( mlogn \ \ _ r\ ( log n log log n \ 
<i\^J iu & \dc(n) log dc(n) J ■ ^ \dc{n) log dc(n) J ' 

Thus, combining all the steps of the permutation evolution bounds the total increase of distinct- 
ness by 



£(S') - £(S) = £(3') - £(3) + f(S 
/ log n log log n 
\dc(n) log dc(n) 
/ log n log log n 
\dc(n) log dc(n) 

The cost of the evolution is at most mlogn since that is the number of Decrease-Key 

ac(n) log dc(n) ' 

operations appended to the sequences, and these all have unit cost. 
5.7 Rounds 

We proceed to perform a sequence of evolutions ^ = (ipo,ipi,ip2, ■ ■ ■} to define a sequence of AI-TSD 
sets (3o, Hi, 32, 33, . . .). The initial set 3o consists of a single sequence of operations: the operation 
Insert(O), executed n times. Each subsequent AI-TSD set 3j is derived from 3j_i by performing 
the single evolution ipi-i; thus in general 3j is composed of some of the sequences of 3j_i with 
some operations appended. 

These evolutions are split into rounds; Oj is the index of the first AI-TSD set of the ith round. 
Thus round i begins with AI-TSD set 3 Di and ends with 5 0i+1 _i through the use of evolutions 
{if) 0i . . . l/fo i+1 -l) 

These rounds are constructed to maintain several invariants: 

• All terminal structures of all sequences in the AI-TSD set at the beginning and end of each 
round have size n. This holds as in each round, exactly one Insert evolution and exactly 
one Extract-Min evolution is performed. 



m 

- 0(log log n) 
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Algorithm 1 Algorithmic presentation of how evolutions are used to build the sequence of AI-TSD 
sequences (Ho, Si, S2, . . .), which are split into rounds where the index of the start of round i is o i . 

n Insert operations 
, " , 

S = {(Insert(O), Insert(O), . . . , Insert(O))} 
i = 

round = 
o = 0; 
loop 

(Ei,p,bigsmall) = EvOLVE-BiG-SMALL(S i++ ); 
if bigsmall = Small then 

Sj = EvOLVE-DECREASE-KEY(S i++ ,£>); > Small round 

3j = Evolve-Permute(Sj ++ ); 
else 

Sj = EvOLVE-DESlGNATED-MlNIMUM-RoOT(Sj ++ ,p); > Big round 

end if 

(Sj, V, em) = EvOLVE-EXTRACT-MiN(Sj ++ ); > Common to small and big rounds 

for each v in V do 

Sj = EvOLVE-DECREASE-KEY(Sj_|__j_, v); 
end for 

Sj = EvOLVE-lNSERT(Sj ++ ); 
°++round — 

end loop 



• All nodes in all terminal structures in the AI-TSD sets at the beginning and end of each 
round are unmarked. 

There are two types of rounds, big rounds and small rounds. At the beginning of both types of 
round a big/small evolution is performed. The return value of the big/small evolution determines 
whether it will be a big or a small round. 

Please refer to Algorithm 1 for a concise presentation of how evolutions are used to construct 
(So, Si, S2, S3, . . .}. We now describe this precess in detail. 

5.7.1 The Big Round 

As the round begins, we know that the terminal structures of the AI-TSD set are entirely unmarked, 
and there are either at least /(n)logn roots, or one root with at least f(n) log ^ n ^\ ogn children. 
The round proceeds as follows: 

1. Perform an designated minimum root evolution on the root with largest subtree; this is the 
node r from the big-small evolution whose location is encoded in the return value; as a result 
of the big-small evolution it is guaranteed to be in the same location in all of the terminal 
structures of the sequences of S. 

2. Perform a Extract-Min evolution. 

3. For each item in the violation sequence returned by the Extract-Min evolution, perform a 
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Decrease-Key evolution. This makes the terminal structures of all heaps in S mark-free. 
4. Perform an Insert evolution. 



Assuming we are in round i, Let em, be the cost of the Extract-Min evolution, and let Vi be 
the size of the violation sequence. The cost of the round (the sum of the costs of the evolutions) 
is errii + V{ + 2, which is at least f(n) log ^ n ^\ ogn i an d based on the evolutions performed the 
distinctness can be bounded as follows: £j — is 0(log(emj + v , + 1)). As vi < errij, the loss of 
distinctness is O(logemj). 



5.7.2 The Small Round 

In a small round, there is one root, call it x, at the same location in all terminal structures, with size 
at least t , -P, and some identical number of children in all terminal structures which is at most 

/(n) logn 

f(n) log j( ra )" ogn • The location of x was returned by the big-small evolution. The round proceeds 
as follows: 



1. Perform an Decrease-Key evolution on x to make it negative infinity. 

2. Perform a permutation evolution. 

3. Perform an Extract-Min evolution. 

4. For each item in the violation sequence returned by the Extract-Min evolution, perform a 
Decrease-Key evolution. 

5. Perform an Insert evolution. 



Let errii be the actual cost of the Extract-Min, let Vi be the size of the violation sequence. 
The cost of the round is errii + Vi + 2 + dc(n)iogdc(n) ' anc ^ based on the evolutions performed 



the distinctness can be bounded as follows: £j — is 0(log(em,j + fj + 1)) — O ^ J"^" |°| 
0(logem t )-^( i°f n " loslosw ' 



dc(n) log dc(n) 

5.8 Upper bound on time 

Lemma 13. The total time to execute any sequence in H Dfe is 0{k\ogn). 

Proof. Let B be a sequence in H Dfe , and let R(B) be the time to execute B. 

Let dci be the cost of the permutation evolution in round i; this is at most dc{n)\ogdc{n) ^ 
round i is a small round and if round i is a big round (recall that permutation evolutions are 
only performed in the small round). Thus the cost for any round i, whether big ( §5. 7. ID or small 
( §5.7.2 j) . can be expressed as 2 + dci + em; + vi. The time to execute any sequence B € H Dfe , which 
we denote as R(B), is the sum of the costs of the rounds: 

k 

R(B) = Y,( 2 + dci + emi + Vi) (1) 

i=l 

An item in the violation sequence has had its rank increase. Its rank can only increase after 
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winning e(n) pairings. Pairings are operations. Thus, 



J> < « (2) 

z — ' en 
Combining ([I]) and ([2]) and rearranging gives: 

EVi < — ^— ^(2 + da + errn + «i) 
e(n) ^— ' 
i=i v ' i=i 

/ 1 \ fe 1 fc 

V u< < ? r - V(2 + + em*) (3) 
z — ' e(n) — 1 z — ' 
i=i v ' i=i 

We know by assumption that the runtime is bounded by the sum of the amortized costs stated 
at the beginning of E j5.ll Each round has one Insert and one Extract-Min (at a amortized cost 
of clogra each) and 1 + tfcj + Vi Decrease-Key operations (at a amortized cost of dc(n) each). 
This gives: 

fc 

R(B) < ^((cfc, + Vi + l)cfc(n) + 2c log n) (4) 

i=l 



Combining ([3|) and (jH gives: 

fc 



fl(B) < £ 



i=l 



2 + + emi 

« c i "I ; 1" 1 acfnj + 2c log n 

e(n) — 1 



Since da < M n)} tg2:(n) and dc(n)loldc(n) ^ S% for sufficiently large n (recall that m is a constant 
defined in ggjrgp . 



i=l 

i?(5) < log n 



e(n) — 1 
2 



+ 1 dc(n) + 



erriidc(n) 
e(n) — 1 



+ 1 + 2c + 



e(n) — 1 



log n 



e(n) — 1 



+ 1 



dc(n) 
log n 



+ 1 + 2c + 



e(n) — 1 



dc(re) 

+ -7-^ r > errn 

e{n) 



i=l 



Since X2t=i em « — ^(-^0 



< fclogn 



e(n) — 1 



+ 1 



dc(n) 
log n 



+ 1 + 2c + 



e(n) — 1 



e(n) — 1 
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R{B) 



( 1 dc(n) \ 
V e(n)-l) 



< klogn 



+ 1|^ + (1 + 2 C + 



e(n) — 1 J logn 
Substituting in the definition of e(n): e(n) = 2dc(n) + 1 



e(n) — 1 



1 \ dcin) ( 1 

+ 1 + 1 + 2c + 



dcin 



log n 



2dc(n) 



2 dcin) 1 
+ + 1 + 2c + 



log n log n 



2dc{n) 



R(B) < 2Hogn 

For large enough n, + ^ + ^ < 1, so 

< 2(2c + 2)A;logn 

Since c is a constant, this concludes the proof. 
Lemma 14. After k rounds, at least | rounds must be small rounds. 



□ 



Proof is by contradiction. Suppose more than ^ rounds are big rounds. The actual cost of a 
big round is at least f(n) log y( n )" og n ? s ° the total actual cost is greater than: 



R{B)> k J^\o g 



f{n) logn 



2 e(n)-i ?iloge(n) 

K— — log ■ 



21oge(n) G 2 e W- 1 logn 

2 2dc(n) n log(2efc(n) + 1) 

fe— /„ , / x — lOg ■ 



since /(n) 



2e(n) — 1 



' 2 log(2dc(n) + 1) 2 Mc M log n 

n ^ 2 2dc(n)-loglogdc(n) log ^ 



log e(n) 
since e(n) = 2dc(n) + 1 



Since dc{n) = uj(1), this contradicts Lemma fT3| that the runtime is 0{k\ogn). 
5.9 Putting it together 

We have bounded the gain of distinctness of a round depending on whether it is big or small. 



-0(emi + Vi) 



If the ith round is a big round (£ )5.7.1|) 



_ -0(e mi + Vl ) + Q ( j°f n "gg) j If the ith round is a small round (gffi]) 
Now we know that X^i=i( em « + ws «) ^ s ^ ess than the actual cost to execute a sequence in 



(5) 



which is O(fclogn) by Lemma [T3l Substituting X^i=i( em « + vs i) = O(klogn) into © gives, and 
using the fact from Lemma Q3] that at least half of the rounds are small rounds: 



^-go =e (fc ]ogn ^^ (w) )-Q( fc to g n) 



(6) 
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Since dc(n) = o ( VgTogfogn ) » dc{n)\o^dc(n) = "W- and thus the negative term in © can be 
absorbed, giving: 

_ = n / HognloglogrA 
^° fc v dc ( n ) lo § dc ( n ) / 

But after sufficiently many rounds (i.e. sufficiently large k) this contradicts Lemma [T2] that 
for all i, £i = 0(n). Thus for sufficiently large k and n a contraction has been obtained, proving 
Theorem (TJ 



6 The sort heap 

6.1 Purpose 

All heaps data structures that support (9(loglog?7,) Decrease-Key require either augmented data 
(e.g. Fibonacci heaps and variants), or an implementation of Decrease-Key that places them 
outside of both our and Fredman's |Fre99bj models for lower bounds. In this section we describe 
a new and simple structure, which we call the sort heap, which has no augmented data and for 
which our lower bound (but not Fredman's) applies. Our structure features O(loglogn) Insert 
and Decrease-Key, and O(lognloglogn) Extract-Min. When the ratio of Decrease-Key 
operations to Extract-Min operations is f2(loglogn), Decrease-Key is the dominant operation 
in the amortized runtime. 

Thus the sort heap is the first self adjusting heap with o(2 2l ° slogn ) Decrease-Key in a 
model with lower bounds for Decrease-Key; in our model the O(loglogn) amortized runtime for 

Decrease-Key is within a 0(log log log n) factor of the Q ( fogk^iogn ) amor ti ze d lower bound. The 
sort heap only differs from the pairing heap in the choice of pairings performed in the Extract-Min 
operation. 

The potential function used to analyze the sort heap is very different from that used to ana- 
lyze Fibonacci heaps, pairing heaps, and their variants. The potential of a node is in the range 
. . . G (log log n), which is a much smaller range than that used in all other potential functions. 
(A smaller range of a potential function indicates that the analysis can be applied to smaller 
sequences. The size of a pontential function's range plays a role in some deamortization transfor- 
mations; see e.g. [BCFL12] ). Secondly, and perhaps more interestingly, the potential function is 
not logarithmic in nature. Previous functions were dominated by a logarithms of some function 
of the subtree size of the node and perhaps the parent of the node. Our potential function has a 
dependence on subtree size, but it is linear. It is inspired by the potential function used to analyze 
skew heaps [ST85 j , a heap that predates all heaps with o(log n) Decrease-Key. 

6.2 Implementation of operations 

The data structure is a min-heap-ordered general tree. 

p =Insert(x): Create a new node with key x and adds it as a new leftmost heap in the forest of 
heaps. Returns a pointer p to the new node. 

Decrease-Key(p, Ax): Detaches the node pointed to by p from its parent (if it is not a root), 
decreases the key value by the nonnegative value Ax, and places it as the root of the leftmost 
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heap in the forest of heaps. 



x =Extract-Min(): Use a linear search to find the root with minimum value. Remove this root 
and return its key value, and place its children (if any) as the leftmost roots of the forest. 
Let k be the total number of roots in the tree. These k roots now must be combined using 
the pairing operation. These k roots are grouped into groups of logn nodes; the last 

group might not be full. Within each group, the key values in the roots are sorted and then 
paired in order from largest to smallest. This leaves one heap per group; these heaps are 
combined via pairings of their roots arbitrarily until a single heap remains. 

Comparison to the Fibonacci heap. The primary difference between the sort heap and the 
Fibonacci heap (and the pointer Fibonacci heap described on pageUJ) is that in the implementation 
of Extract-Min, Fibonacci heaps sort according to subtree size and not according to key value. 
In the Fibonacci heap, a rough approximation of logarithm of the subtree size is stored explicitly 
as an augmented field (called the rank, whose definition is not the same as the use of the word 
rank in this work). Fibonacci heaps also have the notion of marked nodes and cascading cuts; these 
notions add complexity to the implementation of the Decrease-Key operation in some cases. 

Comparison to the pairing heap. The only difference between the sort heap and the pairing 
heap is the choice of pairings performed in Extract-Min. Otherwise, they are identical. 

Comparison to Elmasry's heap. The only difference between the sort heap and Elmasry's heap 
is the implementation of Decrease-Key. Elmasry's heap has an implementation of Decrease-Key 
where the leftmost child of the node having its key decreased is removed and is attached in the 
place of its former parent. The sort heap does not have this leave-one-child-behind behavior; all 
children of the node being decrease-keyed remain attached. 

So, the sort heap takes the idea of Elmasry that sorting on keys is a good substitute for 
the Fibonacci heap's sorting on subtree sizes, and uses the pairing heap's implementation of 
Decrease-Key, which is the simplest. 

6.3 Potential 

The potential method is used to bound the amortized runtimes of the operations of the sort heap. 
6.3.1 Preliminaries 

Let S(x) be the number of nodes in the induced subtree of x and its right siblings. Let L{x) be 
number of nodes in the induced subtree of x, and let R(x) be the number of nodes in the induced 
subtrees of the right siblings of x. These definitions are made so that S(x) = L(x) + R(x). 

We call a node right heavy if R(x) > ^S(x) and left heavy if L(x) > ^S(x). A node that is 
neither left heavy nor right heavy is called transitional; transitional nodes have L(x) and R{x) in 
the range (^S(x)..^S(x)). 
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6.3.2 Potential function 



Given these definitions, we can now define the potential of a node. For ease of presentation, we 
will first define the raw potential of a node C( x )i an d the define the potential of a node, ip(x), to be 
C(x)clog log n, where c is a sufficiently large constant chosen, where sufficiently large will be defined 
in the proof of Lemma [22j This allows us to avoid the clutter of c log log n terms when presenting 
some preliminary lemmas. 

Right heavy nodes have raw potential 1, and left heavy nodes have raw potential 0. Transitional 
nodes have their raw potential linearly vary from to 1, specifically, if x is transitional 

Rjx) - \S{x) 
C(x) = 1 2 . 

For example, the raw potential is \ when R(x) = L(x) = ^S(x). 

Observe that the potential of a node varies from to clog log n. This is a smaller range than 
that used to analyze other pairing-heap type structures. The original analysis of pairing heaps 
[FSST86| and Elmasry heaps lElm09| vary up to logn, while Pettie's analysis of pairing heaps can 
vary by 2 n( - lo ^ l °^ |Pet05j . 

The potential of a heap is the sum of the potentials of the nodes of the heap. 



6.3.3 Properties of the raw potential 

In this section several facts about the raw potential are presented that form the backbone of the 
analysis of the structure. 

Observation 15. A single pairing only changes the raw potential of the two nodes directly involved 
in the pairing; their raw potentials can only change by one. 

Lemma 16. In a heap of size n, the sum of raw potentials of k nodes which are mutual siblings is 
at least k — log 3 n. 

2 

Proof. In order for a node to not have raw potential 1, it must be left heavy or transitional. 
We will bound the number of such nodes. Label the mutual siblings which are left heavy or 
transitional xi,X2, ■ ■ ■ ,X£ in left-to-right order. Thus, R(x{) < 2S )® 1 ' • combining this with the fact 
that R(xi) > S(xi + i) gives S(xi) > ^S(xi+\). Since S(x\) < n and S(xk) > 1, this gives £ < log3 n, 
which gives the result. □ 

Lemma 17. In a heap of size n, the sum of the raw potentials of any set S of nodes which are 
mutual ancestors/ descendants (i.e. a subset of a root-to-leaf path) is at most log3 n. 

2 

Proof. The proof is similar to that of the previous lemma, with the minor need to account for the 
asymmetry of a node itself counting in L(x) and not in R(x). Let x\,X2, ■ ■ ■ ,xe be those nodes 
in S which have nonzero raw potential, ordered such that each node is a descendant of those that 
precede it and an ancestor or those that follow it. Since each node X{ has non-zero raw potential, 
it must be either right-heavy or transitional, which gives L{xj) < 2S ^^ . Since S(xi + \) < L(x,i) — 1, 
we obtain |5(xj) > S(xi + i). Since S(x±) < n and S(xj t ) > 1, this gives t < log3 n, which gives the 
result. □ 
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Lemma 18. Removing a node and its subtree from a heap increases the sum of the raw potential 
of the remaining nodes of the heap by at most 13. 

Proof The only nodes that can have their raw potential increase are the (former) ancestors of the 
disconnected node. The siblings to the left of the (former) ancestors of the disconnected node could 
also have their potential change, but this change will always be negative. Let x±, X2, ■ ■ ■ , x# denote 
these nodes which have their raw potential increase, ordered such that each node is a descendant 
of those that precede it and an ancestor or those that follow it. 

All nodes Xi will have L(x{) decrease by the size of the removed subtree (call it M) while R{xi) 
will remain the same. We will use L(), R(), SQ and ip() to refer to the values before detaching the 
node, and L'Q,R'Q,S'() and ip'Q to be defined as a function of the state of the structure after 
detaching the node. The only way these changes can cause an increase in Xj's raw potential is when 
Xi is left heavy or transitional before the node is disconnected; if it is right heavy, it will remain 
right heavy with unchanged unit raw potential. For the same reasons as in Lemma [T71 we have 
S{x{) > ^S(xi+i). This in turn implies that 

S{xi) > (J\ S(x e ), 

which since S(xi) > M gives 

5(^)>QVm. (7) 

Consider the case where Xi change potential. Since it was transitional or left heavy before, and 
becomes right heavy or transitional after, we know that its increase in raw potential is: 



potential after 



potential before 



C'(xi) - ((xi) = min 



/ right 
heavy 



V 



transitional 



R'(xi) - \S'( Xi ) 



\ 



^S'(xi) 



max 



/ 



/ left 
' heavy 



V 



transitional 



\ 



R(xi) - hS(xi) 



o S{xi) 



J 



The use of min and max works because the formula for a transitional node would be at most 
when applied to a left heavy node and at least 1 when applied to a right heavy node. Removing 
the min and max can only increase the gain: 



< 



R'(xi) - \S'{xi) R{ Xi ) - \S{ Xi ) 



^S'(xi) 



■zS^Xi 



Using the fact that S'(xi) = S{x.i) - M: 

_ R(xi) - l(S{ Xi ) - M) R{ Xi ) - \S{xi) 



US{ Xi )-M) 
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qinr-P R { Xi ) <r 1 
&mCe SlxJ < 1 

< S(xi) - M 

Using (0): 

3M 

~ (§) i-i M-Af 

= ,n^_i ( 8 ) 

ID "I 

Thus we can bound the sum of the raw potential increase of the nodes X\,X2, ■ ■ ■ ,X£—i using 
equation flS} as follows: 

E(c^) - c(x,)) < E * <i2 

i=l t=l I2J ~~ 1 

Using Mathematica, a closed form solution to the limit of this sum was obtained that has an 
upper bound of 12. The node xg was excluded from the above sum, as this would cause a divide- 
by-zero. However, any single node can only change by 1, so including xg will give an upper bound 
of 13. 

As these are the only nodes that can change potential, this completes the lemma. 

□ 

6.3.4 Potential impact resulting from the growth of the structure 

One fact will be of use, is to show that as n changes this does not have a large effect on the potential 
of the nodes, even though all nodes are multiplied by log log n. 

Fact 19. Ifn>3, then n log log n — nloglog(n — 1) < 2 

6.4 Analysis 
6.4.1 Insert(x) 

Lemma 20. The amortized cost of Insert in a sort heap is O(loglogn). 

Proof. The actual cost is 1 as a single pairing is performed. View the change of potential in two 
steps: the first is adjusting the log log n multiplier of the potential of all the nodes caused by the 
incrementing of n. Fact [19] tells us that this increases the potential by at most 0(1). Then look 
at the potential of the new inserted node; this is O(loglogn) since the raw potential of a node is 
at most 1 by observation [15j The only other change possible is in the potential of the former root 
of the heap, again this is trivially bounded by O(loglogn) since this is the maximum attainable 
potential. Thus combining the actual cost plus the change in potential gives the O(loglogn) 
amortized cost. □ 
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6.4.2 Decrease-Key(p, 5x) 

Lemma 21. The amortized cost of Decrease-Key in a sort heap is O(loglogn) 

Proof. The actual cost is 1. We analyze the potential in two steps. Let y be the node pointed to 
by p. The first step is to detach y from its parent and decrease its key. This could cause changes in 
the potential in the (former) ancestors of y. Using Lemma [T8| the increase in potential is bounded 
to be at most 12 log log n. The second step is to pair y with the root of the tree. This can change 
the potentials of only y and the root of the tree, giving an easy bound on the potential gain of 
2 log log n. Thus combining the actual cost plus the gain in potential gives an amortized cost of at 
most 1 + 12 log log n = 0(log log n). □ 

6.4.3 Extract-Min 

Lemma 22. The amortized cost of Extract-Min in a sort heap is O(lognloglogn). 

Proof. Let £ = be the number of blocks which are sorted as part of the Extract-Min; 

they all have size exactly logro except for possibly the last one. The actual cost is therefore 
O(^lognloglogn), as it costs O (log n log log n) to sort each block, and the sorting dominates the 
actual cost. The key observation is that all nodes in a block were mutual siblings, and because 
they are being paired in sorted order, they form a vertical ancestor/descendent chain when the 
Extract-Min is complete. Thus, by Lemma [TBI in each full block the potentials before were at 

least c log n log log n — clog 3 n log log n; for the nonfull block we simply bound the potentials as 

2 

being at least 0. By Lemma [171 in each block the potential after sorting and pairing is at most 

clog 3 n log log n. Thus the amortized cost per block is given by the sum of the actual costs and 

2 

change in potential; this is at most: 

actual cost new potential old potential 



O(^lognloglogra) + ^clog3 n log log n — {£ — l)(clognloglogn — clog3 n log log n) 

2 2 

= Oil log n log log n) + 2£c log 3 n log log n — Ic log n log log n + c log 3 n log log n 



2 



= 0(£ log n log log n) + ( — ^ — c ] £ log n log log n + clog3 n log log n (9) 

^log I J 2 

As we noted in the definition of the potential, the choice of the constant c has been deferred until 



this point. The constant c is chosen to be large enough so that the (^j^t — c j m the second term 
is sufficiently large so as to cancel the big-0 expression, thus giving the amortized cost as: 

< c log 3 n log log n. 

2 

□ 

7 Comments 

Both the proof presented here and the proof of Fredman share many similar aspects. Fredman's 
restriction that comparisons can not be performed without a pairing allowed him to observe a 
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property he called consistency. This is the notion that he did not have to worry about the key 
values themselves, since their ranks were a perfect proxy for key values. Much of our effort has been 
spent to get a result without the consistency property. To do this, we have made some changes to 
the rank function compared to Fredman so that the rank of a node changes more slowly; we attempt 
to keep a node's rank in sync with its key value, but this is not completely possible. However, we 
show that through the use of the violation list and additional Decrease-Key operations to re-key 
items whose rank and key value no longer correlate, something like consistency can be managed. 
The ideas of evolution and having the adversary maintain sets of sequences are new. 

Looking forward, there are still unanswered questions and loose ends for possible future work: 

• We believe that in our lower bound of Q ( lo g^g^g ra ) for Decrease-Key, that the log log log n 
should be able to be removed. 

• In our definition of the pure-pointer model of heaps, we do not allow the algorithm to detach 
a node from its parent unless a Decrease-Key is performed on it or if its parent is being 
removed as part of a Extract-Min operation. The lower bound should be able to be 
extended to allow such operations, which would make our lower bound apply to Elmasry's 
variants of heaps. 

Similarly, our lower bound does not apply to Fibonacci heaps largely because of of the bucket 
sorting used; this is intentional. However, we described the Pointer Fibonacci heap which 
made the Fibonacci heap pointer model. However, the Pointer Fibonacci heap remains out- 
side of our pure heap model because the implementation of Decrease-Key involves more 
than just cutting the node from its parent; there is a process called cascading cuts whereby, 
in some cases, ancestors of the node being DECREASE-KEYed also are cut from their parents 
(Recall from the introduction, in contrast, rank-pairing heaps are a simplification of Fibonacci 
heaps that do not use cascading cuts, and thus our lower bound applies to the natural pointer 
model variant of them) . This technicality caused by the cascading cuts places them outside of 
our model. Fredman's lower bound also would not be able to deal with the cascading cuts, but 
his sequence of operations used in the lower bound only performs Decrease-Key operations 
on roots and children of the the roots, which prevents Fibonacci heaps from ever invoking 
the cascading cut. Although our Decrease-Key operations performed as part of the permu- 
tation evolution also are only performed on roots or children of roots, the Decrease-Key 
operations which are performed on marked nodes as part of processing the violation list could 
cause cascading cuts to be performed. We believe that our bound could be extended to allow 
the algorithm to unlink arbitrary nodes from their parents to cover situations like cascading 
cuts. 

• Our sort heap has a 0(lognloglogra)-time Extract-Min and O(loglogn) Decrease-Key. 
Is there a pure heap model heap with no augmented data with O(logn) amortized time 
Extract-Min and O(loglogn) Decrease-Key? We still do not know whether or not 
pairing heaps are such a heap. 
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